Preprint typeset in JHEP style - HYPER VERSION 

Searching for systematics in SNIa and galaxy 
cluster data using the cosmic duality relation 



Arman Shafieloo 

Asia Pacific Center for Theoretical Physics, Pohang, Gyeongbuk 790-784, Korea 
k 

Department of Physics, POSTECH, Pohang, Gyeongbuk 790-784, Korea 
E-mail: arman@apctp . org 

Subhabrata Majumdar 

Tata Institute for Foundamental Research, Mumbai, India 
-E'-maiZ.'subhaStif r . res . in 

Varun Sahni 

Inter- University Centre for Astronomy and Astrophysics, Post Bag 4, 
Ganeshkhind, Pune 411 007, India 
E-mail: varun@iucaa. ernet . in 

Alexei A. Starobinsky 

Landau Institute for Theoretical Physics RAS, Moscow 119334, Russia 
& 

Research Center for the Early Universe (RESCEU), Graduate School of Science, 
The University of Tokyo, Tokyo 113-0033, Japan 
E-mail: alstar@landau.ac.ru 

Abstract: We compare two different probes of the expansion history of the uni- 
verse, namely, luminosity distances from type la supernovae and angular diameter 
distances from galaxy clusters, using the Bayesian interpretation of Crossing statistic 
[1,2] in conjunction with the cosmic duality relation. Our analysis is conducted in- 
dependently of any a-priori assumptions about the nature of dark energy. The model 
independent method which we invoke searches for inconsistencies between SNIa and 
galaxy cluster data sets. If detected such an inconsistency would imply the pres- 
ence of systematics in either of the two data sets. Simulating observations based 
on expected JDEM supernovae data and X-ray eROSITA + SZ Planck cluster data, 
we show that our method allows one to detect systematics with high precision and 
without advancing any hypothesis about the nature of dark energy. 
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1. Introduction 

A remarkable property of our universe is that is seems to be accelerating during the 
present epoch [3] . The precise cause of cosmic acceleration is presently unknown but 
might rest in the presence of the cosmological constant or some other form of 'Dark 
Energy' (DE) capable of violating the strong energy condition. Alternatively, cosmic 
acceleration might arise due to modifications to the gravitational sector of the theory, 
such as f{R) gravity, Braneworld models, etc. However, as the simple example of 
a non-minimally coupled scalar field shows, these two possibilities are only extreme 
particular cases of a more general notion of DE both including new non-gravitational 
physical fields and modifying gravity, too. Many of these possibilities have been 
extensively examined in recent years [4] but firm conclusions as to the nature of 
dark energy are still to be drawn. One reason for this is that current observational 
data sets, despite steady improvement, are still hampered by uncertainties both of a 
statistical as well as systematic nature [5]. 

Indeed, in order for firm and robust conclusions to be drawn about the nature 
of DE one will need to (a) minimize statistical uncertainties by increasing the depth 
and quality of observational data sets, (b) understand (and model better) the nature 
of systematics in the different kinds of data sets used to explore cosmic acceleration. 

The main evidence for cosmic acceleration currently comes from two types of 
data sets: 

(i) Those probing the luminosity distance, cIl, by observing the flux, J 7 , of type 
la supernovae of given luminosity L through 



where 

HqcIl f dz H(z 



, , j. . • Kz) = (1.2) 

in a spatially flat universe where H(t) = ^ and a(t) is the FRW scale factor, 
(ii) Those based on the angular diameter distance, d A to a source of spatial size 
d via the relation 

A* = A. (L3 ) 

Remarkably, for a wide range of cosmological models, the two distances are 
related through the cosmic duality relation (CDR) 

d L = (1 + zfd A . (1.4) 

In recent years numerous studies have devoted themselves to establishing whether 
or not the CDR holds in practice [6] . The reason for this largely stems from the hope 
that a violation of the equality in (1.4) might signal the presence of new physics. Such 
a violation may occur, for instance, through photon number non-conservation either 
through axion-photon mixing [7], or because of photon absorption enroute to the 
observer [8], or due to an incorrect modelling of the ultra-narrow beams from point 
sources such as type la supernovae [9] . 

However, since (1.4) follows simply: (i) from the requirement that sources and 
observers be connected via null geodesies in a Riemannian space-time, (ii) the phase- 
space conservation of photons; therefore the CDR remains valid for a very wide class 
of spatially homogeneous (and even inhomogeneous) cosmologies [10, 11]. Therefore 
it could well be that the cosmic duality relation is an exact principle in nature. If this 
is indeed the case, then a violation of the CDR would no longer imply new physics, 
but would signal instead to the presence of hitherto undetected systematics in data 
sets relating either to dL or to d& (or to both). Since the actual cosmological model 
of the present Universe is not known and there exist many models of dark energy 
which provide an alternative to an exact cosmological constant, it is interesting to 
investigate if one can compare different observational data and look for systematics in 
them without making any theoretical assumptions regarding the cosmological model. 

The purpose of the present paper is to show how the presence of a systematic 
increase in the value of the cosmological distance modulus (as an observable) 

/i = m-M = 51og 10 d L + 25 (1.5) 

shows itself as an apparent violation of the cosmic duality relation. In this 
paper we use the idea of the Bayesian interpretation of Crossing Statistic [1,2,12] 
along with the smoothing method [13-15] to compare cosmic distances from different 
data sets in a purely model independent manner. We shall show that even a small 
change in /i can be detected using the CDR and cosmological data sets which shall 
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soon become available. Using this approach we are able in fact to disclose differences 
between the two data sets without a need to make any assumptions about the nature 
of dark energy or to set any priors on cosmological parameters. 

In the following, we first discuss the method that we use to compare different data 
sets. The smoothing method and the Bayesian interpretation of Crossing statistic 
will be discussed briefly and we shall explain how we combine these methods to look 
for consistency between different data sets. This idea has been discussed briefly in 
[12] and in this work we explore the application of this method in greater detail 
and in a different context. In this approach we use the smoothing method together 
with the Crossing Statistic to reconstruct the cosmic distances from the two different 
datasets and set confidence limits on the reconstructed Crossing hyperparameters. 
In the absence of systematics in either of the data sets, the confidence limits of the 
Crossing hyperparameters should have a reasonable overlap. On the other hand, 
if the Crossing hyperparameters do not overlap nicely, then this would imply an 
inconsistency between data sets which could be interpreted as the existence of some 
sort of systematics. We shall apply our method on future simulated data where in 
one case we assume that there are no systematics in the data and in the other we 
assume that there exists some form of systematics. Finally, we present our results 
and show how our method can discern the presence of systematics without any prior 
assumption on the cosmological model. 

2. Method and Analysis 
2.1 Smoothing Method 

The smoothing method is a completely model independent approach to derive the 
di{z) relation directly from the data, without any assumptions other than the intro- 
duction of a smoothing scale. The only parameter used in the smoothing method is 
the smoothing width A, which is constrained only by the quality and quantity of the 
data, and has nothing to do with any cosmological model. The smoothing method 
is an iterative procedure with each iteration typically giving a better fit to the data. 
It has been shown in [13-15] that the final reconstructed results are independent of 
the assumed initial guess, d^Zi) 9 . 

The modified smoothing method (error-sensitive) can be summarised by the fol- 
lowing equation [15]: 



lnrf L (z,A) s = In d L (z) 9 
, ^ [lnd L ( Zi ) - In d L ( Zi y\ 
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where di(z) is the data, N(z) is the normalization factor, d^izi) 9 is the initial guess 
model and A is the width of smoothing. 

The absolute brightness of type la supernovae is degenerate with H since the ob- 
served quantity is the distance modulus fJ>(z). The outcome of the smoothing method 
is therefore H d L (z)/c = d r ^ c (z) = (1 + z)D(z). In this paper we set A = 0.30 which 
is similar to the value used in [15]. Complete explanation of the relations between A, 
the number of data points, quality of the data and results of the reconstruction ex- 
ercise can be found in [13, 14]. It has earlier been shown that the smoothing method 
is a promising approach to reconstruct the expansion history of the universe. How- 
ever, setting confidence limits has been an issue and in earlier work the bootstrap 
approach was used to set the confidence limits. In this paper, the reconstructed form 
of (1l(z) will be used as a mean function in the full reconstruction process [12] which 
includes the idea of Bayesian interpretation of Crossing Statistic as explained in the 
next section. 



2.2 Reconstructing the Expansion History of the Universe using the Cross- 
ing Statistic 

The main idea behind the crossing statistic lies in the fact that the actual model 
of the universe and the reconstruction using smoothing would have one or two mild 
crossings: the distance modulus fiducial of the fiducial cosmological model and the 
reconstructed fj,(z) smooth would cross each other at one or two points in the redshift 
range defined by the data [1]. 

Furthermore, in a FRW universe the distance modulus monotonically increases 
with redshift. Consequently any two cosmological models become virtually indistin- 
guishable if the distance modulus of one of them is multiplied by a suitable function 
of the redshift. The coefficients of this function (multiplying /i) constitute the Cross- 
ing hyperparameters and the functions themselves will be called Crossing functions 
following [2]. In our case the Crossing functions multiply the smoothed distance 
modulus reconstructed from the data using the method described in the previous 
section. We shall refer to the smoothed functions as the mean functions since they 
accurately describe the mean value of the function that is being smoothed, which in 
this particular case is [i. From the preceding argument, and that in [2], it is clear that 
the crossing functions, multiplied by the mean functions, should virtually coincide 
with the actual model of the universe. 

The reconstruction of the expansion history of the universe using Bayesian in- 
terpretation of Crossing statistic is therefore a combination of a non-parametric 
(smoothing) method with a parametric approach (using a Crossing function) to de- 
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fine and set the confidence limits on the cosmic expansion history. The crossing 
function is defined by Chebyshev polynomials [2, 12]: 



T n {C u C 2 , z) = l + Ci(— ) + C 2 [2( — ) 2 - 1], (2.2) 

Zmax Zmax 

and we fit 

f4mooth( Z ) = VSmoothiz) X T n (C U C 2 , z) (2.3) 

to the data and find the best fit point C\ est , C| es * in the hyperparameter space as 
well as the Ci,C 2 parameters related to the la, 2<j and 3a confident limits. Each 
Tji(Ci,C 2 , z) (where Ci,C 2 pairs are within hyperparameter confidence contours) 
multiplied by Hsmooth{z) represents a reconstruction of the expansion history of the 
universe consistent with the data. 

It has been shown that this method works very well in reconstructing the expan- 
sion history of the universe and in determining cosmological quantities such as the 
Hubble parameter h(z), Om diagnostic Om(z) and the deceleration parameter q(z) 
[12], since by reconstructing cIl(z) one can derive h(z), Om(z) and q(z) using: 



H(z) 



d ( diXz) 



dz \1 + z 



(2.4) 



^) = (l + ^)f||-l (2.5) 
h 2 (z) - 1 

where derivatives are respect to the redshift and h(z) = H(z)/H . Its important to 
note that in applying this method to derive the cosmological quantities in (2.6) one 
does not require a prior knowledge of the matter density. In fact Om(z) can be used 
to falsify the ACDM model without any prior knowledge of Qo m since Om(z) is a 
constant at all redshifts in ACDM [16, 17]. 

2.3 Comparing Data Sets 

We shall apply our method to compare the consistency of the luminosity distance 
(obtained from type la supernovae) and the angular size distance (obtained from X- 
ray and SZ clusters). To compare these two data sets one need not reconstruct h(z), 
Om(z), etc., instead one can simply compare the confidence limits of the Crossing 
hyperparameters, C\ and C 2 , derived from these different data sets. In order to 
perform this comparison, we first apply the smoothing method on supernovae data to 
derive Hsmooth{z) and then use this fJ*smooth{ z ) to fit both luminosity distance data from 
supernovae as well as angular diameter distance data from X-ray and SZ clusters. 
This is done using Chebyshev Crossing functions in (2.2) so that one derives the 
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Crossing hyperparameters in each case. In other words, after deriving fismooth(z) by 
smoothing supernovae data, we fit supernovae data using (2.3) and place constraints 
on Cf N and Cl N . Since the mean function is derived from supernovae data itself we 
expect the confidence limits of Cf N and Cf N to be centred around the (0, 0) point 
in the hyperparameter space. 

Next we again use (2.3) (with the same fismooth( z ) from supernovae data) but 
replace supernovae data by cluster data to determine C^ luster and C^ luster ■ If the 
confidence contour of Ci luster and (7^ uster has a significant overlap with the confi- 
dence contour of Cf N and C^ N , then we can conclude that the two data sets (SNIa 
& Cluster) are in concordance with each other. Otherwise we are observing an incon- 
sistency most probably due to the presence of systematics in either of the data sets. 
The reader should note that in fitting both supernovae and cluster data, and to deter- 
mine crossing hyperparameters, we use the same smooth mean function, Hsmooth(z) 
(Above it was derived from supernovae data). In fact the crossing hyperparameters 
in each case represent the deviation from the mean function suggested by the data. 
If two data sets imply different deviations from a given mean function, ^smooth(z), 
then this might suggest an inconsistency between the different data sets. 

We apply this method to a simulated future SNIa data set assumed to have 
about 2300 supernovae with 0.015 < z < 1.7 and <7j n j = 0.13, and consistent with 
expectations from the JDEM survey [18]. For our second data set we focus on the 
reasonably good angular diameter distance data expected from future SZ and X-ray 
surveys. Current and upcoming surveys in both radio (SZ), like SPT, ACT or Planck, 
and X-ray, like eROSITA would have large patches of the observed sky in common 
and hence would be expected to have a common ensemble of clusters. If a subset of 
these clusters have measured temperatures, which will naturally come about during 
follow-up mass calibrations, then this cluster ensemble can be used to estimate the 
angular diameter distance to these clusters. The cIa{z) constructed from this dataset 
will therefore provide a complementary probe of cosmology to (1l(z) from SNIa. 
We construct mock catalogs of such (1a{z) measurements to see the implications for 
cosmology and systematics. For this purpose, we use clusters that will be potentially 
found in two all sky surveys, Planck (which has already started discovering clusters 
using their SZ signatures), and eROSITA (which is the upcoming X-ray survey to 
be launched in 2014). For Planck, we assume that clusters will be detected over a 
limiting flux of 300 mJy (at 353 GHz), which will return close to ~ 2000 clusters in 
~ 32000 deg 2 of the sky. The higher flux limit means that Planck will detect only 
the high mass nearby clusters. eROSITA, which is a deeper survey, is modelled to 
observe clusters over a flux limit of 4 x 10~ 14 erg cm _2 s _1 in the [0.5 — 2.0 keV] band. 
It will detect ~ 10 5 clusters over ~ 32000 deg 2 . All the clusters detected by Planck 
will be seen by eROSITA. 

The clusters that will be jointly found in these two surveys, and hence the con- 
structed cLa(z), will depend on the underlying flux-limited cluster redshift distribu- 
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tion dN/dz of these surveys , as well as the redshift distribution of the subset for 
which the temperature can be measured. We begin by, first, constructing the cluster 
dN/dz using the corresponding mass-limit, Mu m (z), to the above flux-limits. For de- 
tails see [19-21]. In our analysis, we select clusters that lie above the highest of the 
Mu m (z) from either of the surveys. This gives the first subset of common clusters. 
Next, using the fact that we need roughly ten times the photons to measure X-ray 
temperatures, we calculate the corresponding Mn m (z) for ten times the detectable 
flux. This gives a further subset to the one already realised. Finally, we chose a final 
subset from these clusters, namely only those that will be large enough to be fit by 
a simple /3-model; this is possible if the cluster core radius subtends an angular size 
at the cluster redshift which is at least few times the beam size (typically 2-3 times 
the eROSITA beam which we take to be ~ 16 arc-sec). The corresponding isophotal 
size is converted to a limiting mass using cluster scaling relations. The redshift-cf^z) 
catalog is constructed by integrating the cluster mass function (for our fiducial cos- 
mology) over the highest of these limiting masses. This gives the number of clusters 
for which we would be able to estimate dj^{z). Finally, the clusters are distributed 
randomly with a Gaussian scatter of 25% about the (Ia(z) from the fiducial cosmol- 
ogy. Since, typically, we expect to measure (1a(z) in the future with better than 20% 
accuracy, we put a conservative error of 20% on each measure of (Ia{z). For our mock 
(Ia{z) catalog we get ~ 1830 clusters between 0.1 < z < 1.1 for a fiducial ACDM 
cosmology. At the end of this paper we also use a fiducial DE model with w = —0.8 
to cross check our proposed method and in that case get more than 2000 clusters up 
to redshifts of 1.2. 

Note that cluster data has the form of dA(z)H /c where H is the Hubble constant 
and c is the speed of light, whereas Supernovae data involves the distance modulus 
(1.5) which is directly related to di,{z) in (1.2). In this context it is useful to note 
that while: (i) H Q acts as a nuisance parameter when dealing with supernovae data 
since it is degenerate with the latter's brightness, (ii) H is needed to derive cIa(z) 
from cluster data and this adds an additional element of uncertainty to the analysis. 

• In the first stage of our analysis we assume that there are no systematics in 
the data and that the value of the Hubble parameter is perfectly known. In 
simulating the data we assume a flat ACDM model with Qo m = 0.26 and 
H = 70km/ sec/Mpc and we apply our method of comparison to this fiducial 
dark energy model. Fig.l shows the contours of the Crossing hyperparameters 
derived from SNIa and Cluster data and one sees that they have a proper 
overlap. Note that while we have assumed ACDM to be our fiducial model in 
simulating the data, similar results would have been obtained using any other 
fiducial cosmology to generate data. Therefore the results of fig.l are essentially 
model independent. We have shown this at a later part of this paper where 
we did our analysis for another dark energy model with w = —0.8 for better 
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Figure 1: Confidence contours of the Crossing hyperparameters using simulated future 
supernovae data from JDEM survey (la CL in magenta, 2a CL in blue) and simulated 
future (SZ + X-ray) angular diameter distance data (la CL in green, 2a CL in red). The 
two data sets are clearly consistent in the absence of systematics in either of the data. 
Note that the simulated data are based on a flat ACDM model with Qom = 0.266 (we 
have to assume a model to simulate the data in any case) but in our analysis and in the 
determination of these confidence contours we have not assumed any cosmological model 
which make our method model independent. 

clarification. 

• Next we assume that the supernovae data set contains systematics which re- 
sult in the following redshift dependent increase in the value of the distance 



where a quantifies the amount of systematics. The left panel of fig. 2 shows a 
comparison between the two data sets after assuming a = 0.01 for the system- 
atics. One clearly sees that the two contours are far apart which is indicative 
of a strong inconsistency between the SNIa and Cluster data sets. In the right 
panel of fig. 2 the systematics have been reduced five-fold to a = 0.002. Looking 
at this figure we notice that the two data sets are still somewhat inconsistent 
since the la confidence contours do not intersect. Note that this is hardly 
the case if one visualises the data themselves, which is what we have done in 
fig. 3 in which the data sets corresponding to a = (left panel) and a = 0.002 



modulus: 




(2.7) 
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Figure 2: Confidence contours of the Crossing hyperparameters using simulated future 
supernovae data (la CL in magenta, 2a CL in blue) and simulated future (SZ + X-ray) 
angular diameter distance data (la CL in green, 2a CL in red) after assuming the presence 
of systematics in the supernovae data. The inconsistency between the two data sets is 
clearly visible both in the left panel (at > 2a CL) and marginally so in the right panel 
(at la CL) . The systematics is reflected in a redshift dependent increase in the distance 
modulus (1.5) and is described by (2.7) with a = 0.01 (left panel) and a = 0.002 (right 
panel) . 



(right panel) are shown superimposed. 1 It is clearly difficult to see any visual 
difference between the left and right panels of this figure, whereas such a dif- 
ference is discernable when we compare fig. 2 (right) with fig. 1. We believe this 
reflects the strength of our method in discerning the presence of small amounts 
of systematics in different data sets. 

Note that the analysis thus far assumes that the value of Ho is known perfectly. 
In reality there will be uncertainties in Hq from observations carried out in the local 
universe. In fig. 4 we show the results of a comparison between SNIa and Cluster data 
after assuming a = 0.002 and allowing the Hubble parameter to have a flat prior 
in the range 68 < Hq < 72. This figure shows that an uncertainty in Hq does not 
seem to affect the detected inconsistency between the two data sets as it inflates the 
contour volumes in a direction which is perpendicular to that of overlap (between 
the contours). This is good news since we are still able to test for data systematics 
even without a precise knowledge of H , thereby mimicking the current observational 
situation. 

1 Wc have transformed the angular diameter distance data to the distance modulus by error 
propagation after assuming that we know the correct value of the Hubble parameter. 
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Figure 3: The distance modulus for the SNIa and cluster data is compared in these figures. 
The distance modulus for cluster data has been obtained after applying the cosmic duality 
relation (1.4). The left panel alludes to data with no systematics. The systematic errors in 
the right panel are described by a = 0.002 in (2.7). It clearly seems difficult to distinguish 
between the two panels by eye, whereas the crossing statistic shown in the right panel of 
the previous figure accomplishes this. 



As a final check, we test our method using a different fiducial model in order to 
make sure that it can be applied to different models of dark energy. We simulate 
the data for dark energy with w = —0.8 and our results are presented in fig. 5. 
This figure assumes systematic magnitude shift errors for supernovae similar to the 
previous case, namely a = 0.002. Again the confidence contours corresponding 
to the SNIa and Cluster data sets are well distinguished, demonstrating that our 
approach in identifying the presence of systematics is independent of the background 
cosmological model. 



3. Conclusion 



In this paper we present a robust and easy to use method of comparing different 
cosmological data sets in order to search for possible systematics in them. It has been 
shown earlier that combining the smoothing method and Crossings Statistic results 
in an approach which can be easily used to reconstruct the expansion history of the 
universe and the properties of dark energy without setting any priors on cosmological 
parameters [12]. In this paper we have used the idea of Bayesian interpretation of 
Crossing statistic [2] to test the consistency of two important cosmological data sets, 
namely the luminosity distance to type la supernovae, and the angular diameter 
distance to galaxy clusters, in order to search for possible systematics in either of the 
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Figure 4: Confidence contours of the Crossing hyperparameters using simulated future 
supernovae data (la CL in magenta, 2a CL in blue) and simulated future SZ angular diam- 
eter distance data (la CL in green, 2a CL in red) assuming the presence of small systematic 
errors in supernovae data, namely a = 0.002 in (2.7). We assume no precise knowledge 
of Hq choosing instead a flat prior on 0.68 < ho < 0.72 (ho = Ho/100 km/sec/Mpc). 
Interestingly the uncertainty in Ho inflates the cluster contour in a direction which is or- 
thogonal to that of the SNIa contour and therefore does not affect our consistency check. 
Consequently one can still pick out the presence of systematics at roughly the la level even 
if the value of Ho is not precisely known and the systematic uncertainties in the data are 
exceedingly small. Note that we have not assumed any particular cosmological model for 
this comparison and analysis. 

data sets. We have shown that our method can identify the presence of systematics 
(such as a small magnitude shift in supernovae data) in a model independent manner 
and even in the absence of a precise knowledge of the Hubble constant, Hq. The 
method developed by us succeeds in discerning the presence of systematics at the 
sub-percent level in data sets expected to become available in the near future. 
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